Group O HIV-1, fragments of such viruses, and uses thereof

ABSTRACT

PCT No. PCT/FR96/00294 Sec. 371 Date Dec. 1, 1997 Sec. 102(e) Date Dec. 1, 1997 PCT Filed Feb. 26, 1996 PCT Pub. No. WO96/27013 PCT Pub. Date Sep. 6, 1996Group HIV-1 retrovirus strains, particularly the strains known as BCF02, BCF01, BCF06, BCF07, BCF08, BCF11, BCF03, BCF09, BCF12, BCF13 and BCF14, fragments of said retroviruses, and the uses thereof as a diagnostic reagent and as an immunogen, are disclosed.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to retrovirus strains of the HIV-1 group, group O, and in particular the strains called BCF02 (ESS), BCF01 (FAN), BCF06 (LOB), BCF07 (MAN), BCF08B (NKO), BCF11 (NAN) and BCF03 (POC), to fragments of the said retroviruses and to their applications as diagnostic reagent and as immunogenic agent.

2. Description of the Background

Two distinct types of HIV (human immunodeficiency virus: HIV-1 and HIV-2) have been described and are the agents responsible for AIDS. Analysis of their nucleic acid sequence has made it possible to identify various subtypes of HIV-1, although no correlation could be established between variability and pathogenicity. Similarly, HIV-2 exhibits a greater genetic and biological diversity than that previously envisaged.

Analysis of nucleotide fragments of various HIV-1 isolates has shown the existence, through the analysis of the env gene, of at least 7 different subtypes, called A to G (MYERS G. et al., Human retroviruses and AIDS, 1993, Los Alamos Nat. Lab.).

More recently, two other isolates, considered to be considerably more distant from the other 7 subtypes, that is to say whose sequence homology is the most distant from that of the reference HIV-1 strains, have also been isolated: HIV-1ANT70 and HIV-1MVP5180, obtained from Cameroonian patients, and have been attached to a new HIV-1 group, group O, as opposed to group M corresponding to the 7 abovementioned A-G subtypes, taking into account their genomic organization (5' LTR Gag Pol Vif Vpu Vpr Tat Rev Env Nef LTR 3'), (Patent Application WO 89/12094, European Patent Application No. 0,591,914, GuURTLER L. G. et al., J. Virol., 1994, 68, 1581-85).

Analysis of the DNA sequences has shown 65-70% similarity with HIV-1 and 56% with HIV-2.

By using a competition immunoblotting method, using a peptide V3 from MVP5180, a 7-8% prevalence is found in Yaounde. This prevalence might be underestimated since the V3 loop is known, in all the HIV-1 subtypes, to be a highly variable region. Molecular studies also indicate the presence of group O virus in Gabon, France, Spain and Germany.

Ongoing serological studies reveal group O HIV-1 infections in Nigeria, Niger and Senegal.

Knowledge of these various groups and subtypes is particularly important for developing:

reagents for screening for HIV infections which are sufficiently sensitive and specific, that is to say which do not lead to false-negative or false-positive results; and

compositions which protect against all existing subtypes, including the entire group O viruses.

Indeed, it has been shown, in particular, that some detection reagents were not sufficiently sensitive and did not always make it possible to detect group O HIV-1 infections (LOUSSERT-AJAKA I. et al., Lancet, 1994, 343, 1393-94), which has led to the withdrawal of three screening kits and to the declassification of two others on the French market.

The results show that the group O viruses are very distant from the group M HIV-1 subtypes. These results indicate that the HIV-1 viruses ought to be classified in two different groups, namely: HIV-1 M and HIV-1 O. The group O viruses appear to be capable of being transmitted by horizontal and vertical routes, leading to a very wide distribution of this infection. The pathogenicity of these viruses is under study. The divergence of these viruses should be taken into account in the sensitivity of diagnostic tests and in the development of vaccines.

SUMMARY OF THE INVENTION

Consequently, the Applicant set itself the objective of providing fragments, derived from selected group O HIV-1 strains, capable of allowing both detection of the entire group O HIV-1 viruses and specific intra-group O differentiation and which are also capable of inducing protection against the entire HIV-1 subtypes, including group O, which fragments make it possible to avoid obtaining false-negative or false-positive results; to do this, the inventors selected a set of strains and sequences, derived essentially from a region of the env gene, in particular at the level of a hypervariable fragment situated in the V3 loop (C2V3) or at the level of gp41 and of a region of the gag gene.

The subject of the present invention is group O HIV-1 strains exhibiting the morphological and immunological characteristics of one of the retroviruses deposited at the Collection Nationale de Cultures de Microorganismes held by Institut Pasteur under the numbers I-1544 (called BCF02 (ESS)), I-1543 (called BCF01 (FAN)), I-1546 (called BCF07 (MAN)), I-1547 (called BCF08 (NKO)), I-1545 (called BCF03 (POC)), on the date of Feb. 24, 1995.

The subject of the present invention is also a nucleic acid. fragment, characterized in that its nucleotide sequence is chosen from those which are contained in one of the nucleotide sequences included in the env or gag genes of the group O HIV-1 strains and its variants and comprises:

either one of the following sequences, included in the C2V3-env gene fragment (hypervariable loop of gp120): SEQ ID No. 1 (BCF02 (ESS)), SEQ ID No. 2 (BCF08 (NKO)), SEQ ID No. 3 (BCF03 (POC)), SEQ ID No. 4 (BCF06 (LOB)), SEQ ID No. 5 (BCF07 (MAN)), SEQ ID No. 6 (BCF01 (FAN)), SEQ ID No. 7 (BCF11 (NAN)), SEQ ID No. 50 (BCF09), SEQ ID No. 51 (BCF12), SEQ ID No. 52 (BCF13), SEQ ID No. 53 (BCF14),

or one of the following sequences, included in the gp41 fragment of the env gene: SEQ ID No. 8 (BCF02 (ESS)), SEQ ID No. 9 (BCF08 (NKO)), SEQ ID No. 10 (BCF03 (POC)), SEQ ID No. 11 (BCF06 (LOB)), SEQ ID No. 12 (BCF07 (MAN)), SEQ ID No. 13 (BCF01 (FAN)), SEQ ID No. 14 (BCF11 (NAN)), SEQ ID No. 54 (BCF09), SEQ ID No. 55 (BCF12), SEQ ID No. 56 (BCF13), SEQ ID No. 57 (BCF14),

or one of the following sequences, included in the gag gene: SEQ ID No. 15 (BCF02 (ESS)), SEQ ID No. 16 (BCF08 (NKO)), SEQ ID No. 17 (BCF03 (POC)), SEQ ID No. 18 (BCF05 (LOB)), SEQ ID No. 19 (BCF07 (MAN)), SEQ ID No. 20 (BCF01 (FAN)), SEQ ID No. 21 (BCF11 (NAN)), SEQ ID No. 58 (BCF09), SEQ ID No. 59 (BCF12), SEQ ID No. 60 (BCF13), SEQ ID No. 61 (BCF14),

or, if the sequence is not identical to one of the above nucleotide sequences, or is not complementary to one of these sequences, is nonetheless capable of hybridizing with a nucleic sequence derived from a group O HIV-1 virus.

DETAILED DESCRIPTION OF THE INVENTION

For the purposes of the present invention, nucleic sequence is understood to mean the sequences, as specified above and their complementary sequences, as well as the sequences containing them.

Such sequences find application both in the specific identification of a group O HIV-1, as diagnostic reagent, alone or in a pool with other reagents, for the identification of any HIV-1 or alternatively, depending on the cases, as reagent for intra-group O differentiation.

These sequences may be used in particular in diagnostic tests comprising either a direct hybridization with the viral sequence to be detected, or an amplification of the said viral sequence, using, as primers, an oligonucleotide, included in any one of the above sequences and in particular one of the following sequences:

sequences gag

GAG/5'CAM or G5: CAGGGACAAATGGTACATCA (positions 1250-1269) (SEQ ID No. 74)

GAG/3'CAM or G3: AGTAGCTTGCTCAGCTCTTAAT (positions 1768-1747) (SEQ ID No. 75)

sequences gp41

SEQ ID No. 22 (gp41/5'CAM-1): AGRGAAAAAGAGCAGTAGGAT (positions 7800-7821)

SEQ ID No. 23 (gp41/5'CAM-2): TCTAAGTGCAGCAGGTAGCACTAT (positions 7843-7866)

SEQ ID No. 24 (gp41/3'CAM-2): CTAAGTTGCTCAAGAGTGGTA (positions 8594-8573)

SEQ ID No. 25 (gp41/3'CAM-1): GTTGCTCAAGAGGTGGTAAGT (positions 8590-8570)

sequences C2V3:

C2V3/5'CAM or V3L5: TRGTTACTTGTACACATGGCAT (positions 6991-7012) (SEQ ID No. 76)

C2V3/3'CAM or V3L3: ACAATAAAAGAATTCTCCATGACAGT (positions 7421-7396) (SEQ ID No. 77).

The abovementioned positions correspond to those of the Ant70 sequence (Myers, Korber et al., cited above).

The subject of the present invention is also group O HIV-1 strains, characterized in that they comprise at least one of the sequences selected from the group consisting of the sequences SEQ ID No. 1 to SEQ ID No. 7 or SEQ ID No. 50 to SEQ ID No. 53, at least one of the sequences selected from the group consisting of the sequences SEQ ID No. 8 to SEQ ID No. 14 or SEQ ID No. 54 to SEQ ID No. 57 and at least one of the sequences selected from the group consisting of the sequences SEQ ID No. 15 to SEQ ID No. 21 or SEQ ID No. 58 to SEQ ID No. 61.

According to an advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 50, SEQ ID No. 54, SEQ ID No. 58; this strain has been called BCF09.

According to another advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 51, SEQ ID No. 55, SEQ ID No. 59; this strain has been called BCF12.

According to yet another advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 52, SEQ ID No. 56, SEQ ID No. 60; this strain has been called BCF13.

According to yet another advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 53, SEQ ID No. 57, SEQ ID No. 61; this strain has been called BCF14.

According to another advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 7, SEQ ID No. 14, SEQ ID No. 21; this strain has been called BCF11.

According to yet another advantageous embodiment of the said strain, it comprises the sequences SEQ ID No. 4, SEQ ID No. 11, SEQ ID No. 18; this strain has been called BCF06.

The subject of the invention is also the use of the sequences described above for carrying out a process of hybridization or gene amplification of nucleic sequences of the HIV-1 type, these processes being applicable to the in vitro diagnosis of the potential infection of an individual with an HIV-1 type virus, including group O.

This in vitro diagnostic method is carried out using a biological sample (serum, circulating lymphocytes) and comprises:

a step of extracting the nucleic acid to be detected, belonging to the genome of the HIV-1 type virus, which may be present in the biological sample and, where appropriate, a step of treating the nucleic acid with the aid of a reverse transcriptase, if the latter is in RNA form,

at least one cycle comprising the steps of denaturation of the nucleic acid, annealing with at least one sequence in accordance with the invention and extension of the hybrid formed, in the presence of the appropriate reagents (polymerizing agent such as DNA polymerase and dNTP), and

a step of detecting the possible presence of the nucleic acid belonging to the genome of a group O HIV-1 type virus (group specificity).

The subject of the invention is also a peptide characterized in that it is expressed by a nucleotide sequence as defined above.

Among these peptides, there may be mentioned in particular:

those expressed by the C2V3-env gene fragment in accordance with the invention: SEQ ID No. 26 (BCF02 (ESS)), SEQ ID No. 27 (BCF01 (FAN)), SEQ ID No. 28 (BCF01 (FAN)), SEQ ID No. 29 (BCF06 (LOB)), SEQ ID No. 30 (BCF07 (MAN)), SEQ ID No. 31 (BCF11 (NAN)), SEQ ID No. 32 (BCF08 (NKO)), SEQ ID No. 33 (BCF08 (NKO)), SEQ ID No. 34 (BCF03 (POC)), SEQ ID No. 35 (BCF03 (POC)), SEQ ID No. 62 (BCF09), SEQ ID No. 63 (BCF12), SEQ ID No. 64 (BCF13), SEQ ID No. 65 (BCF14),

those expressed by the gp41env gene fragment in accordance with the invention: SEQ ID No. 36 (BCF02 (ESS)), SEQ ID No. 37 (BCF01 (FAN)), SEQ ID No. 38 (BCF06 (LOB)), SEQ ID No. 39 (BCF07 (MAN)), SEQ ID No. 40 (BCF08 (NKO)), SEQ ID No. 41 (BCF03 (POC)), SEQ ID No. 42 (BCF11 (NAN)), SEQ ID No. 66 (BCF09), SEQ ID No. 67 (BCF12), SEQ ID No. 68 (BCF13), SEQ ID No. 69 (BCF14),

those expressed by the gag gene fragment in accordance with the invention: SEQ ID No. 43 (BCF02 (ESS)), SEQ ID No. 44 (BCF01 (FAN)), SEQ ID No. 45 (BCF06 (LOB)), SEQ ID No. 46 (BCF07 MAN)), SEQ ID No. 47 (BCF11 (NAN)), SEQ ID No. 48 (BCF08 (NKO)), SEQ ID No. 49 (BCF03 (POC)), SEQ ID No. 70 (BCF09), SEQ ID No. 71 (BCF12), SEQ ID No. 72 (BCF13), SEQ ID No. 73 (BCF14).

The subject of the invention is also immunogenic compositions comprising one or more products of translation of the nucleotide sequences according to the invention or a fragment thereof and/or at least one of the peptides as defined above.

The subject of the invention is also the antibodies directed against one or more of the peptides described above and their use for carrying out methods of in vitro diagnosis of the infection of an individual with an HIV-1 type virus, according to processes known to persons skilled in the art.

By way of illustration, such an in vitro diagnostic method according to the invention comprises bringing a biological sample collected from a patient into contact with antibodies according to the invention, and detecting, with the aid of any appropriate process, in particular with the aid of anti-labelled immunoglobulins, the immunological complexes formed between the antigens of the HIV-1 type viruses which may be present in the biological sample and the said antibodies.

The subject of the present invention is also a process for screening and typing group O HIV-1, characterized in that it comprises bringing any of the nucleotide fragments in accordance with the invention into contact with the nucleic acid of the virus to be typed and detecting the hybrid formed.

In addition to the preceding arrangements, the invention further comprises other arrangements, which will emerge from the description which follows, which refers to examples for carrying out the process which is the subject of the present invention as well as to the accompanying drawings, in which:

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 represents the compared sequences of amino acids expressed by the env C2V3 gene fragment;

FIG. 2 represents the compared sequences of amino acids expressed by the gag gene fragment;

FIG. 3 illustrates the results obtained with the 7 strains BCF02 (ESS), BCF01 (FAN), BCF07 (MAN), BCF11 (NAN), BCF08 (NKO), BCF03 (POC) and BCF06 (LOB) on agarose gel, of a PCR carried out with the abovementioned C2V3 primers;

FIG. 4 illustrates the results obtained with the 7 strains BCF02 (ESS), BCF01 (FAN), BCF07 (MAN), BCF11 (NAN), BCF08 (NKO), BCF03 (POC) and BCF06 (LOB) on agarose gel, of a PCR carried out with the abovementioned Gag primers;

FIG. 5 represents the markers used in FIGS. 3 and 4;

FIG. 6 illustrates the reactivity of the sera corresponding to the strains according to the invention, relative to the phenetic organization of these variants;

FIG. 7 illustrates a phylogenetic analysis based on the gag region.

It should be understood, however, that these examples are given solely by way of illustration of the subject of the invention and do not in any way constitute a limitation thereto.

EXAMPLE 1 Production of the Sequences in Accordance with the Invention

Patients

Seven Cameroonian patients, consulting or hospitalized in Paris hospitals are included in this study.

At the time of diagnosis of the seropositive state, four patients are at the AIDS stage (CDC IVA n=1, CDC IVC1 n=3) and 3 patients are asymptomatic (CDC LI). The age group varies between 22 and 44 years for the symptomatic patients and it ranges between 22 and 68 years for the asymptomatic patients.

Four patients have a CD4+ level<200×10⁶ /μl (10, 19, 91, 97), 2 patients have CD4+ levels of between 200 and 500×10⁶ /μl (384, 420) and one patient has CD4+ levels>500×10⁶ /μl (575).

Cultures

Ten to twenty ml of total blood for these 7 patients were collected over lithium heparin. The peripheral blood mononuclear cells (PBMC) are isolated on a Ficoll-Hypaque centrifugation gradient (Pharmacia). The cellular pellet obtained is washed twice with RPMI 1640 and resuspended in culture medium at a concentration of 2×10⁶ cells/ml. This culture medium contains RPMI 1640 supplemented with 20% foetal calf serum, 10% human interleukin, 150 μg/ml of streptomycin, 250 units/ml of penicillin G and 5% L-glutamin. One million of the patient's cells is cocultured in duplicate with one million cells from donors stimulated with PHA in 24-well culture plates (Costar). The culture medium is changed twice per week, and 3×10⁵ donor cells are added to each well on d7, d14 and d21.

The viral replication is monitored in the culture supernatants for 28 days, simultaneously by a microtechnique (measurement of the reverse transcriptase activity) and the detection of the p24 antigen (ELAVIA® p24 Ag, Sanofi-Diagnostic Pasteur). The positive-culture supernatants are collected and kept at -80° C. for the reinoculations.

Preparation of the DNA

A PCR is carried out using the DNA of fresh lymphocytes extracted from a patient or of lymphocytes, after 6 days of coculture or plasma after RT. Using 100 μl of a reaction mixture, containing Tris-HCl 10 mmol/l pH 8.3, KCl 50 mmol/l, MgCl₂ 2 mmol/l, 0.2 mmol/l of each dNTP, 40 pmol of each primer, 2.5 U of Taq polymerase (Perkin Elmer Cetus, St Quentin Yvelines, France) and 1 μg of cellular DNA. The primers used are those specified above, namely:

sequences gag:

GAG/5'CAM: CAGGGACAAATGGTACATCA (positions 1250-1269) (SEQ ID No. 74); GAG/3'CAM: AGTAGCTTGCTCAGCTCTTAAT (positions 1768-1747) (SEQ ID No. 75)

sequences gp41

gp41/5'CAM-1: AGRGAAAAAAGAGCAGTAGGAT (positions 7800-7821) (SEQ ID No. 22)

gp41/5'CAM-2: TCTAAGTGCAGCAGGTAGCACTAT (positions 7843-7866) (SEQ ID No. 23)

gp41/3'CAM-2: CTAAGTTGCTCAAGAGTGGTA (positions 8594-8573) (SEQ ID No. 24)

gp41/3'CAM-1: GTTGCTCAAGAGGTGGTAAGT (positions 8590-8570) (SEQ ID No. 25)

or alternatively one of the following sequences: SEQ ID No. 76 and SEQ ID No. 77 for the env region and SEQ ID No. 74 and SEQ ID No. 75 for the gag region, corresponding respectively to nucleotides 6991-7012, 7421-7396 and 1250-1269, 1768-1747 of the HIV1^(Ant70) sequence.

The samples are subjected to 40 amplification cycles, each cycle comprising the following three steps: denaturation at 94° C. for one minute, annealing of the primers at 50° C. for the gag sequence and at 55° C. for the env sequence, for one minute and extension at 72° C. for one minute. During the first cycle, the denaturation is carried out for 4 minutes and for the last cycle, the extension is carried out for 5 minutes. The amplified products are subjected to enzymatic digestion (Xho1, EcoR1), purified and cloned into a vector M13mp18, digested with the restriction enzymes Sal1 and EcoR1.

For each patient, between 3 and 4 clones are sequenced (Applied 373A sequencer), in a 406-bp region of the gag gene, in a 320-bp region of the env gene included in the V3 region and at the level of the region encoding gp41.

EXAMPLE 2 Immunodetection of a Group O HIV-1.

ELISA tests are carried out in microtitre plates (Falcon 3912, microtest III®, Becton Dickinson).

The wells are covered with 100 μl of one of the V3 peptides defined above: BCF08 (NKO): R T I Q E I H S G P M A W Y S L G L K R N T T V R (SEQ ID No. 33); BCF01 (FAN): R S V Q E M K I G P L S W Y S M G L A A N S S I K (SEQ ID No. 28); BCF03 (POC): R I K Q I G I G P M S V Y S G S L A D L G N N N (SEQ ID No. 35), diluted to 10 μg.ml⁻¹ in a 0.05 M carbonate buffer pH 9.6 by overnight incubation, at +4° C.

The plates are then washed three times with a PBS buffer containing Tween® 20 (Prolabo, France), (PBS-Tween®) 0.1%, and then they are saturated with PBS, which is supplemented with 1% milk (Gloria Co., France), for 1 h at 37° C.

Sera (1/100), diluted in a PBS-milk mixture containing Tween® 20 at 0.1% (PBS-milk-Tween®), are incubated for 2 h at 37° C.

After three washes, anti-human polyvalent antibodies, conjugated with alkaline phosphatase (Sigma), diluted 1:10,000 in PBS-milk-Tween, are added and incubated for 1 h at 37° C. After the last wash, the coloured reaction is developed at 37° C., with an alkaline phosphatase substrate, p-nitrophenylphosphate (Sigma), diluted in a 0.05 M carbonate buffer pH 9.5, containing 2 mM MgCl₂, in order to obtain a concentration of 1 mg.ml⁻¹. The absorbance, measured at 405 nm (A₄₀₅) is recorded with an apparatus (MR 5000, Dynatech). A cut-off is determined for each serum tested and corresponds to three times the A₄₀₅ value obtained with the peptide E19S, derived from Plasmodium malariae.

Table 1 below shows the results obtained.

The consensus and FR 15-1 sequences correspond to those obtained from the B subtype, found in France.

    __________________________________________________________________________            Seronegative and seropositive African patients (group M)                Peptides/Serum                                                                        8189                                                                              8362                                                                               8364                                                                              8365                                                                               8366                                                                              8370                                                                               8429                                                                              8499                                                                               8503                                                                              8116                                                                               8122                                                                              8186                                                                               3171                          __________________________________________________________________________     HIV Serology                                                                          -  -   -  -   -  -   +  +   +  +   +  +   +                             Neg    0.129                                                                             0.319                                                                              0.158                                                                             0.158                                                                              0.328                                                                             0.152                                                                              0.17                                                                              0.157                                                                              0.28                                                                              0.201                                                                              0.14                                                                              0.171                                                                              0.17                          CONS                        1.46   0.65                                                                              2.52                                                                               1.72                                                                              1.92                              FR15-1                      2.64   1.04                                                                              2.77                                                                               2.3                                                                               2.86                              MVP5180                                                                        ANT70                                                                          BCF08 (NKO)                                                                    BCF01 (FAN)                                                                    BCF03 (POC)                                                                    __________________________________________________________________________                      Group O patients                                                        Peptides/Serum                                                                        BCF06                                                                              BCF07                                                                              BCF01                                                                              BCF03                                                                              BCF09                                                                              VAU BCF08                                                                              Neg Blank                         __________________________________________________________________________               HIV Serology                                                                              + + +   +   +   +   +   -                                           Neg        0.175     0.299                                                                    0.197                                                                              0.257                                                                              0.146                                                                              0.181                                                                              0.327                                                                              0.193                                                                              0.127                                   CONS                           0.72                                            FR15-1                         0.67                                            MVP5180                                                                                   0.76                                                                ANT70      >3  1.83        0.5                                                 BCF08 (NKO)                                                                               1.76.68                                                                            1.16                                                            BCF01 (FAN)                                                                               2.28.08                                                                            1.51    0.68                                                                               0.59                                                BCF03 (POC)                                                          __________________________________________________________________________      Only the results greater than twice the background have been kept        

    Sequence of the peptides                                                       Consensus                                                                             NTRKSINIGPGRAFYATGEII                                                   FR15-1 NTRKGINIGPGRAFYTTGEII                                                   MVP5180                                                                               REVQDIYTGPMRWRSMTLKRSNNTS                                               ANT70  RDIQEMRIGPMAWYSMGIGGTAGNS                                               BCF08 (NKO)                                                                           RTIQEIHSGPMAWYSLGLKRNTTVR                                               BCF01 (FAN)                                                                           RSVQEMKIGPLSWYSMGLAANSSIK                                               BCF03 (POC)                                                                           RIKQIGIGPMSVYSGSLADLGNNN                                           

This Table I shows the specificity of the peptides according to the invention.

EXAMPLE 3 Serotype, Phenotypes and Genotype characteristics of the Strains According to the Invention

These characteristics are illustrated in Tables II and III below.

                                      TABLE II                                     __________________________________________________________________________     SEROTYPE           PHENOTYPE                                                   Antigen E1A   V3   Sensitivity to antiviral agents                             HIV-1         ANT70                                                                               TIBO  DELA-                                                                               SAQUINA-                                         1     2 3 4 5 6  7 RO82913                                                                              VERDINE                                                                             VIR                                              __________________________________________________________________________     BCF01                                                                              - + + + + +  + R     R    R                                                BCF02                                                                              - - - - - +  + R     R    S                                                BCF03                                                                              + - + + - -  - R     R    S                                                BCF06                                                                              + - + + + ±                                                                              + R     R    S                                                BCF07                                                                              + - - + + +  + R     R    S                                                BCF08                                                                              + - + + + -  ±                                                                             R     R    S                                                BCF11                                                                              + - + + + +  ±                                                                             R     S    R                                                VAU + - + + + +  + S     S    S                                                __________________________________________________________________________      1 Test WELLCOME competition HIV1                                               2 Test CLONATEC indirect HIV1/2                                                3 Test ABBOTT 3rd                                                              4 Test WELLCOME 3rd                                                            5 Test BOEHRINGER                                                              6 EIA ANT70 V3                                                                 7 Dot blot ANT70 V3 INNOLIA                                                    R: Resistant                                                                   S: Sensitive                                                             

                  TABLE III                                                        ______________________________________                                         PHENOTYPE     GENOTYPE                                                         on MT2 cells  PCR                                                              SI/NSI    p24     ROCHE   V3 LOOP SUMMIT                                       ______________________________________                                         BCF01 NSI     +       -     K   I   G   P   L   S   W                          BCF02 NSI     +       -     R   I   G   P   M   A   W                          BCF03 SI      +       -     G   I   G   P   M   S   V                          BCF06 NSI     +       -     A   T   G   P       R   W                          BCF07 NSI     +       -     K   I   G   P   M   A   W                          BCF08 NSI     +       -     H   S   G   P   M   A   W                          BCF11 NSI/NR  -       -     G   I   G   P   L   S   W                          VAU   Not     Not     Not   M   A   G   P   M   A   W                                tested  tested  tested                                                   ______________________________________                                          SI: Induction of syncytia on MT2 cells                                         NSI: No induction of syncytia on MT2 cells                                     NR: Nonreplicative                                                             p24: Production (+) or absence (-) of p24 antigen on cultures of MT2 cell

Serotype analysis on a set of commercialized tests (Table II) (with the exception of Test No. 6, EIA Ag V3 ANT70, non-commercialized research product) reveals the great diversity of antibody response towards the subtype B antigens which dominate in the west and relative to the antigen of the V3 loop of the ANT70 strain. The sera, corresponding to each of the isolates according to the invention, exhibit a unique and characteristic profile. BCF 01 is the only one positive on Test 2 (Clonatec®). BCF 02 is completely negative on the subtype B antigens. BCF 03, conversely, reacts with subtype B but is negative on the ANT70 antigens. BCF 07 is the only one negative on Tests No. 2 (Clonatec) and No. 3 (Abbott 3rd), but reactive on all the other tests with group M antigen. BCF 08 is strictly negative on the EIA V3 ANT70 test but, like BCF 11, is weakly reactive on this antigen by Test 7 (InnoLIA). By comparison, the serum corresponding to the VAU strain, which has been molecularly characterized by CHARNEAU et al. (Virology, 1994, 205, 247-253), is positive on all these antigens with the exception of Test 2 (Clonatec®). This diversity in antibody response should be interpreted as a reflection of the antigenic diversity of these strains. A link with immunodepression is excluded, the patients BCF 07, 08 and 11 being completely asymptomatic with a CD4 number>400/ml. FIG. 6 indicates the reactivity of the sera corresponding to the strains according to the invention relative to the phenetic organization of these variants.

The phenotype characterization of the isolates according to the invention shows the importance and the need to have a large number of sensitive reagents. All are naturally resistant to the molecule Tibo Ro82913 (Table III) as already reported for HIV-2. An absence of an in vitro growth-inhibiting activity outside a treatment with Ro82913 has never been reported for HIV-1 before. In contrast, the VAU strain is perfectly sensitive to Ro82913.

The strains are also resistant to another non-nucleoside inhibitor, Delaverdine, with the exception of the BCF 11 strain, which is sensitive.

Conversely, this strain is resistant to Saquinavir, an anti-protease, whereas the other strains are sensitive to it.

This diversity in response to the anti-retroviral agents reinforces the notion of a high dispersion right inside the Cameroon variants.

Furthermore, the results of syncytia formation on the continuous line MT2 complicates any classification since there is no correlation with the preceding serotype or genotype results and no relationship with the clinical stage. These strains do not induce syncytial formation whereas the detection of p24 antigen in the supernatants confirms the replication on this line.

Finally, at the genotype level, all the amplifications of the genomes of the strains according to the invention are negative by PCR using a kit marketed by Roche Diagnostics, which uses primers and probes corresponding to a conserved region of the HIV-1 Gag gene; a negative result was also possible for this test in the HIV-1 A subtypes (LOUSSERT-AJAKA et al., 1995, Lancet, 346, 912-913; LOUSSERT-AJAKA et al., 1995, 346, 1489). This negative result within variant isolates from Cameroon but also within the A subtypes reinforces this notion of high HIV-1 variability and shows the need for the detection of the HIV-1 nucleic acids, to use a large panel of sequences and in particular those according to the present invention.

FIGS. 6 and 7 correspond to a phylogenetic analysis based on the gag region.

The phylogeny of the gag genes confirms the dispersion of these variants and the difficulty in regrouping them either into subtype, or even into an exclusion group and shows the need for a selection of variants exhibiting particular phenetic and genetic characters and whose corresponding sera have characteristic reactivity and non-reactivity profiles capable of allowing the detection of HIV-1 in sera previously considered to be false-negative.

As evident from the above, the invention is not at all limited to its embodiment, implementation and application which have just been defined more explicitly; it embraces, on the contrary, all the variants which may occur to a specialist in this field, without departing from the framework or the scope of the present invention. ##SPC1##

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES: 81                                             - (2) INFORMATION FOR SEQ ID NO:1:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 291 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                  - GTGGTTACTT GTACACATGG CATCAAGCCA ACAGTAAGTA CTCAGCTAAT AT - #TAAATGGA          60                                                                           - ACACTCTCAG AAGGAAAGAT AAGAATGATG GCAAAAAATA TTTCGGATAG TG - #GCCAAAAT         120                                                                           - ATCATAGTGA CCCTAAATAC TACTATAAAC ATGACCTGCC AGAGACCAGG AC - #ATCAAACA         180                                                                           - GTACAAGAGA TAAGGATAGG TCCAATGGCC TGGTACAGCA TGGGCTTAGC GG - #CAGGAAAC         240                                                                           #            291GAAGAGC TTATTGTGAA TATAATACCA CTAATTGGAT A                     - (2) INFORMATION FOR SEQ ID NO:2:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 294 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                  - GTAGTTACTT GTACACATGG CATCAAGCCA ACAGTGAGTA CTCATCTAAT AT - #TAAATGGG          60                                                                           - ACAATCTCTG AAGGAGAAAT AAGAATTATG GGAAAAAATA TTCGGGAAAA TG - #CTAAAAAT         120                                                                           - ATCATAGTGA CCCTAAATTC TACTATAAAC ATGACCTGTG AGAGACCAGA GG - #GAAATCTG         180                                                                           - ACAATACAAG AGATACACTC AGGACCAATG GCCTGGTACA GCTTGGGACT AA - #AGAGAAAT         240                                                                           - ACAACCGTAA GATCAAGATC AGCTCATTGC AAGTATAACA CCACTAATTG GG - #AA               294                                                                           - (2) INFORMATION FOR SEQ ID NO:3:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 297 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                  - GTGGTTACTT GTACACATGG CATCAAGCCA GCAGTAAGTA CTCAGCTAAT AT - #TAAATGGG          60                                                                           - ACACTCTCTA AAGGAAAAAT AAGAATTATG GCAAAAAATA TTACAAACAC TG - #GGAATAAT         120                                                                           - ATCATAGTGA CTCTAAATTC CACCATAAAC ATAACCTGTA ACAGACCAGG AA - #GGGGAATA         180                                                                           - AAACAGATAG GTATAGGTCC AATGTCCGTA TACAGCGGGA GCTTAGCGGA CT - #TAGGGGGA         240                                                                           - AACAACAACT CAAGGATAGC TTATTGCGAT TATGACATCA CTAAGTGGAA CG - #AAACA            297                                                                           - (2) INFORMATION FOR SEQ ID NO:4:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 294 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                  - GTAGTTACTT GTACACATGG CATCAAGCCA ACAGTAAGTA CTCAATTAAT AA - #TGAATGGG          60                                                                           - ACACTCTCTA GAGGGAAGAT AAGAATTATG GGAAGAAATA TTACAGACAA TA - #CAAAGAAT         120                                                                           - ATTATAGTAA CCTTAAACAC TTCTATAAAC ATGACATGTA TGAGAAAAGG AA - #GAGGTAAA         180                                                                           - ATACAAAGGA TAGCGACAGG TCCACTGCGA TGGGTCAGTA TGGCAGCTAA AA - #CAGAGTCA         240                                                                           - CAGAACACAG GGTCAAGGAT AGCTTATTGT ATGTATAATA ACACTGAATG GA - #TA               294                                                                           - (2) INFORMATION FOR SEQ ID NO:5:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 291 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                  - GTGGTTACTT GTACACATGG CATCAAGCCA ACAGTAAGTA CTCAGCTAAT AT - #TAAATGGA          60                                                                           - ACACTCTCGA AAGGAAAGAT AAGACTGATG GCAAAAAATA TTTCGGATAG TG - #GCCAAAAT         120                                                                           - ATCATAGTGA CCCTAAATAC TACTATAAAC ATGACCTGCC ATAGACCAGG AA - #ATCTAAAA         180                                                                           - GTACAGGAGA TAAAGATAGG TCCAATGGCC TGGTACAGCA TGGGCATAGA GA - #ATGAAAAC         240                                                                           #            291GAAAAGC TTATTGTGAT TATAATACCA CTAAGTGGGT A                     - (2) INFORMATION FOR SEQ ID NO:6:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 291 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                  - GTAGTTACTT GTACACATGG CATCAAGCCC ACAGTGAGTA CTCAACTGAT AT - #TAAATGGG          60                                                                           - ACACTCTCTG AAAAGGGAAT AAGAATTATG GGAAAAAACA TTTCAAAAAC TG - #GGGAAAAT         120                                                                           - ATCATAGTGA CCCTAAATGT AAGCATAAAC ATTACTTGTC ATAGACCAGG AA - #ATCTGTCA         180                                                                           - GTACAAGAGA TGAAAATAGG TCCACTGTCC TGGTACAGCA TGGGCCTAGC GG - #CAAACTCA         240                                                                           #            291GGGTAGC TTATTGCAAT TATAGTACCA CTGAATGGAC A                     - (2) INFORMATION FOR SEQ ID NO:7:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 296 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                  - GTGGTTACTT GACACATGGC ATCAAGCCAG CAGTAAGTAC TCAACTAATA CT - #AAATGGGA          60                                                                           - CACTCTCTGA AGGGAAGATA AGAATTATGG GACAAAATAT CTCTGACAGT GG - #AAAGAATA         120                                                                           - TCATAGTAAC CCTAAATAAG ACTGTAAACA TGAACATAAC CTGCACAAGA GA - #TGGAGATC         180                                                                           - AGAAGGTACA AGAGATAGGG ATAGGTCCAC TGTCATGGTA CAGTATGAGC AT - #TGCAGAAG         240                                                                           - ACAGCGCTAA AAACACAAGA GCAGCTTATT GTAACTATAG TGCAAGTAGT TG - #GAAG             296                                                                           - (2) INFORMATION FOR SEQ ID NO:8:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                  - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACTCGTGGG GCTGTAAGGG AAGGATAGTC TGCTACACAT CAGTAAAATG GA - #ACTGGACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:9:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                  - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACCTATGGG GCTGTAAGGG AAGGCTACTC TGCTACACAT CAGTAAAATG GA - #ATACGACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:10:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                 - AGACAACTCC GAGCTCGCCT GCAAGCCTTA GAAACCTTAA TCCAGAATCA GC - #AACTCCTA          60                                                                           - AGCCTGTGGG GCTGTAAAGG AAGGCTAGTC TGCTACACAT CAGTAAAATG GC - #ACAACACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:11:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                 - AGACAACTCC GAGCTCGCCT GCAAGCCTTA GAACCCCTTA TACAGAATCA GC - #AACGCCTA          60                                                                           - AGCCTATGGG GATGTAAGGG AAGGATAATA TGTTACACAT CAGCAAAATG GA - #ACAACACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:12:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACTCATGGG GCTGTAAGGG AAGGCTAGTC TGTTACACAT CAGTAAAATG GA - #ACGAGACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:13:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTGA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACCTATGGG GCTGTAAGGG AAGGCTACTC TGCTACACAT CAGTAAAATG GA - #ACAGTACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:14:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                 - AGACAACTCC GAGCTCGCCT GGTTGCCTTA GAAACCCTTG TACAGAATCA GC - #AACTCCTA          60                                                                           - AACCTATGGG GCTGTAAAGG AAGACTAACA TGCTATACAT CAGTAAAATG GA - #ATGACACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:15:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                 - CCCATTTCTC CTAGAACTTT AAATGCATGG GTAAAGGCAG TAGAAGAGAA AG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTGTTCCCTA TG - #ATATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGAACATCAA GGGGCTTTAC AAGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CATTGGAGTG GGATAGAACT CACCCACCAC CGATAGGGCC GT - #TACCACCA         240                                                                           - GGGCAGATAA GGGACCCAAC AGGAAGTGAC ATTGCTGGAA CAACTAGCAC TC - #AGCAAGAG         300                                                                           - CAAGTTCACT GGGTGACCAG GAACCCCAAC CCTATCCCAG TAGGAGACAT CT - #ATTGGAAA         360                                                                           #   399            TTAA CAAATTGGTT AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:16:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                 - CCCCTCTCCC CCAGGACTTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA AG - #CCTTTAAC          60                                                                           - CCTGAGATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTATTCCCTA TG - #ATATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGGACATCAA GGAGCCCTAC AAGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CAGCAGATTG GGATAGAACT CACCCGCCAC CGATAGGGCC AT - #TACCACCA         240                                                                           - GGGCAGATAA GGGAACCAAC AGGAAGTGAC ATTGCTGGGA CAACTAGCAC CC - #AGCAAGAG         300                                                                           - CAAGTTCACT GGATTACCAG AGCCAACCAA TCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:17:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                 - CCCCTCTCCC CCAGGACTCT AAATGCATGG GTAAAGGCAG TAGAAGAAAA AG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CAATTCCCTA TG - #ATATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGGACATCAA GGAGCTTTAC AAGTGTTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CATCAGATTG GGATAGAACT CACCCACCAC CGATAGGGCC GC - #TGCCTCCA         240                                                                           - GGGCAAATAA GGGAACCAAC AGGAAGTGAC ATTGCTGGGA CAACTAGTAC CC - #AGCAAGAG         300                                                                           - CAAGTTCACT GGACTACCAG ACCCAATCAA CCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:18:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                 - GCCCTCTCCC CCAGGACGTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA GG - #CCTTTAAC          60                                                                           - CCTGAAATTA TTCCTATGTT TATGGCATTA TCAGAAGGAG CTGTTCCCTA TG - #ATATCAAT         120                                                                           - ACCATGCTAA ATGCCATAGG AGGACACCAA GGGGCTTTAC AAGTGTTGAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CAGCAGAATG GGATAGAACT CATCCACCAG CAATGGGGCC GT - #TACCACCA         240                                                                           - GGGCAGCTAA GAGATCCAAC AGGAAGTGAC ATTGCTGGAA CAACTAGCAC AC - #AGCAAGAG         300                                                                           - CAAATTAACT GGATTACTAG ACCAAATAAC CCTGTCCCTG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA TAAAATGGTA AAGTTGTAC                                   - (2) INFORMATION FOR SEQ ID NO:19:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                 - GCCCTTTCCC CTAGAACTTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA AG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTATTTCCTA TG - #ACATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGGACATCAA GGGGCTTTAC AAGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CAGCAGAGTG GGATAGAACT CACCCAATAC CGGTAGGGCC GT - #TACCACCA         240                                                                           - GGGCAGATAA GGGACCCAAC AGGAAGTGAC ATTGCTGGGA CAACTAGCAC CC - #AGCAAGAA         300                                                                           - CAAGTTCACT GGACAACCAG ACCCAACAAC CCTATCCCAG TAGGAGACAT CT - #ATAGGAAA         360                                                                           #   399            TTAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:20:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                 - GCTATCTCCC CCAGGACTTT AAATGCATGG GTAAAGGCAG TAGAAGAGAA GG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTATTCCCTA CG - #ATATTAAT         120                                                                           - ACCATGCTAA ATGCCATAGG AGGACATCAA GGAGCCTTGC AGGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGATGAAG CAGCAGATTG GGATAGAACT CACACACCAC CGGTAGGGCC GT - #TGCCACCA         240                                                                           - GGGCAGATAA GGGAACCAAC AGGAAGTGAC ATTGCTGGGA CAACTAGCAC CC - #AGCAAGAG         300                                                                           - CAAGTTCATT GGATTACTAG GCCCAACAAC CCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:21:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                 - GCCCTCTCCC CCAGGACTTT AAATGCATGG GTAATAGCAG TAGAAGAGAA AG - #CCTTTAAC          60                                                                           - CCTGAAATTA TTCCTATGTT TATGGCATTA TCAGAAGGAG CTGTTCCCTA TG - #ATATCAAT         120                                                                           - ACCATGCTAA ATGCCATAGG AGGACACCAG GGGGCTTTAC AAGTGTTGAA GG - #AAGTGATC         180                                                                           - AATGAAGAAG CAGCAGATTG GGACAGAACT CATCCACCAC CAGTAGGGCC GT - #TACCACCA         240                                                                           - GGTCAGATAA GGGAACCAAC AGGGAGTGAT ATTGCTGGAA CCACTAGCAC AC - #AGCAAGAG         300                                                                           - CAAATTCACT GGATTACTAG GGGAGGTAAT TCTATCCCAG TAGGAGACAT AT - #ATAGGAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:22:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 22 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                 #                 22AGG AT                                                     - (2) INFORMATION FOR SEQ ID NO:23:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                 #                24AGCA CTAT                                                   - (2) INFORMATION FOR SEQ ID NO:24:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 21 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                 #21                TGGT A                                                      - (2) INFORMATION FOR SEQ ID NO:25:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 21 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                 #21                TAAG T                                                      - (2) INFORMATION FOR SEQ ID NO:26:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 97 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Th - #r Val Ser Thr Gln Leu          #                15                                                            - Ile Leu Asn Gly Thr Leu Ser Glu Gly Lys Il - #e Arg Met Met Ala Lys          #            30                                                                - Asn Ile Ser Asp Ser Gly Gln Asn Ile Ile Va - #l Thr Leu Asn Thr Thr          #        45                                                                    - Ile Asn Met Thr Cys Gln Arg Pro Gly His Gl - #n Thr Val Gln Glu Ile          #    60                                                                        - Arg Ile Gly Pro Met Ala Trp Tyr Ser Met Gl - #y Leu Ala Asn Gly Asn          #80                                                                            - Gly Ser Glu Ser Arg Arg Ala Tyr Cys Glu Ty - #r Asn Thr Thr Asn Trp          #                95                                                            - Ile                                                                          - (2) INFORMATION FOR SEQ ID NO:27:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 97 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Th - #r Val Ser Thr Gln Leu          #                15                                                            - Ile Leu Asn Gly Thr Leu Ser Glu Lys Gly Il - #e Arg Ile Met Gly Lys          #            30                                                                - Asn Ile Ser Lys Thr Gly Glu Asn Ile Ile Va - #l Thr Leu Asn Val Ser          #        45                                                                    - Ile Asn Ile Thr Cys His Arg Pro Gly Asn Le - #u Ser Val Gln Glu Met          #    60                                                                        - Lys Ile Gly Pro Leu Ser Trp Tyr Ser Met Gl - #y Leu Ala Ala Asn Ser          #80                                                                            - Ser Ile Lys Ser Arg Val Ala Tyr Cys Asn Ty - #r Ser Thr Thr Glu Trp          #                95                                                            - Thr                                                                          - (2) INFORMATION FOR SEQ ID NO:28:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 25 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                 - Arg Ser Val Gln Glu Met Lys Ile Gly Pro Le - #u Ser Trp Tyr Ser Met          #                15                                                            - Gly Leu Ala Ala Asn Ser Ser Ile Lys                                          #            25                                                                - (2) INFORMATION FOR SEQ ID NO:29:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 98 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Th - #r Val Ser Thr Gln Leu          #                15                                                            - Ile Met Asn Gly Thr Leu Ser Arg Gly Lys Il - #e Arg Ile Met Gly Arg          #            30                                                                - Asn Ile Thr Asp Asn Thr Lys Asn Ile Ile Va - #l Thr Leu Asn Thr Ser          #        45                                                                    - Ile Asn Met Thr Cys Met Arg Lys Gly Arg Gl - #y Lys Ile Gln Arg Ile          #    60                                                                        - Ala Thr Gly Pro Leu Arg Trp Val Ser Met Al - #a Ala Lys Thr Glu Ser          #80                                                                            - Gln Asn Thr Gly Ser Arg Ile Ala Tyr Cys Me - #t Tyr Asn Asn Thr Glu          #                95                                                            - Trp Ile                                                                      - (2) INFORMATION FOR SEQ ID NO:30:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 97 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Th - #r Val Ser Thr Gln Leu          #                15                                                            - Ile Leu Asn Gly Thr Leu Ser Lys Gly Lys Il - #e Arg Leu Met Ala Lys          #            30                                                                - Asn Ile Ser Asp Ser Gly Gln Asn Ile Ile Va - #l Thr Leu Asn Thr Thr          #        45                                                                    - Ile Asn Met Thr Cys His Arg Pro Gly Asn Le - #u Lys Val Gln Glu Ile          #    60                                                                        - Lys Ile Gly Pro Met Ala Trp Tyr Ser Met Gl - #y Ile Glu Ala Glu Asn          #80                                                                            - Ile Pro Asp Ser Arg Lys Ala Tyr Cys Asp Ty - #r Asn Ala Thr Lys Trp          #                95                                                            - Val                                                                          - (2) INFORMATION FOR SEQ ID NO:31:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 99 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Al - #a Val Ser Thr Gln Leu          #                15                                                            - Ile Leu Asn Gly Thr Leu Ser Glu Gly Lys Il - #e Arg Ile Met Gly Gln          #            30                                                                - Asn Ile Ser Asp Ser Gly Lys Asn Ile Ile Va - #l Thr Leu Asn Lys Thr          #        45                                                                    - Val Asn Met Asn Ile Thr Cys Thr Arg Asp Gl - #y Asp Gln Lys Val Gln          #    60                                                                        - Glu Ile Gly Ile Gly Pro Leu Ser Trp Tyr Se - #r Met Ser Ile Ala Glu          #80                                                                            - Asp Ser Ala Lys Asn Thr Arg Ala Ala Tyr Cy - #s Asn Tyr Ser Ala Ser          #                95                                                            - Ser Trp Lys                                                                  - (2) INFORMATION FOR SEQ ID NO:32:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 98 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Th - #r Val Ser Thr His Leu          #                15                                                            - Ile Leu Asn Gly Thr Ile Ser Glu Gly Glu Il - #e Arg Ile Met Gly Lys          #            30                                                                - Asn Ile Arg Glu Asn Ala Lys Asn Ile Ile Va - #l Thr Leu Asn Ser Thr          #        45                                                                    - Ile Asn Met Thr Cys Glu Arg Pro Glu Gly As - #n Leu Thr Ile Gln Glu          #    60                                                                        - Ile His Ser Gly Pro Met Ala Trp Tyr Ser Le - #u Gly Leu Lys Arg Asn          #80                                                                            - Thr Thr Val Arg Ser Arg Ser Ala His Cys Ly - #s Tyr Asn Thr Thr Asn          #                95                                                            - Trp Glu                                                                      - (2) INFORMATION FOR SEQ ID NO:33:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 25 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                 - Arg Thr Ile Gln Glu Ile His Ser Gly Pro Me - #t Ala Trp Tyr Ser Leu          #                15                                                            - Gly Leu Lys Arg Asn Thr Thr Val Arg                                          #            25                                                                - (2) INFORMATION FOR SEQ ID NO:34:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 99 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                 - Val Val Thr Cys Thr His Gly Ile Lys Pro Al - #a Val Ser Thr Gln Leu          #                15                                                            - Ile Leu Asn Gly Thr Leu Ser Lys Gly Lys Il - #e Arg Ile Met Ala Lys          #            30                                                                - Asn Ile Thr Asn Thr Gly Asn Asn Ile Ile Va - #l Thr Leu Asn Ser Thr          #        45                                                                    - Ile Asn Ile Thr Cys Asn Arg Pro Gly Arg Gl - #y Ile Lys Gln Ile Gly          #    60                                                                        - Ile Gly Pro Met Ser Val Tyr Ser Gly Ser Le - #u Ala Asp Leu Gly Gly          #80                                                                            - Asn Asn Asn Ser Arg Ile Ala Tyr Cys Asp Ty - #r Asp Ile Thr Lys Trp          #                95                                                            - Asn Glu Thr                                                                  - (2) INFORMATION FOR SEQ ID NO:35:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 24 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                                 - Arg Ile Lys Gln Ile Gly Ile Gly Pro Met Se - #r Val Tyr Ser Gly Ser          #                15                                                            - Leu Ala Asp Leu Gly Asn Asn Asn                                                          20                                                                 - (2) INFORMATION FOR SEQ ID NO:36:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Ser Trp Gly Cys Lys Gl - #y Arg Ile Val Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Trp Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:37:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Leu Trp Gly Cys Lys Gl - #y Arg Leu Leu Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Ser Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:38:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                                 - Arg Gln Leu Arg Ala Arg Leu Gln Ala Leu Gl - #u Pro Leu Ile Gln Asn          #                15                                                            - Gln Gln Arg Leu Ser Leu Trp Gly Cys Lys Gl - #y Arg Ile Ile Cys Tyr          #            30                                                                - Thr Ser Ala Lys Trp Asn Asn Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:39:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Ser Trp Gly Cys Lys Gl - #y Arg Leu Val Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Glu Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:40:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Leu Trp Gly Cys Lys Gl - #y Arg Leu Leu Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Thr Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:41:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:                                 - Arg Gln Leu Arg Ala Arg Leu Gln Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Ser Leu Trp Gly Cys Lys Gl - #y Arg Leu Val Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp His Asn Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:42:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                                 - Arg Gln Leu Arg Ala Arg Leu Val Ala Leu Gl - #u Thr Leu Val Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Leu Trp Gly Cys Lys Gl - #y Arg Leu Thr Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Asp Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:43:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:                                 - Pro Ile Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Val Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Glu          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Leu Glu Trp Asp Arg Thr His Pro Pro Pro Il - #e Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Val Thr Ar - #g Asn Pro Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Trp Lys Trp Ile Va - #l Phe Gly Leu Asn Lys          #       125                                                                    - Leu Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:44:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                                 - Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Asp Glu Ala          #    60                                                                        - Ala Asp Trp Asp Arg Thr His Thr Pro Pro Va - #l Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Glu Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Ile Thr Ar - #g Pro Asn Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:45:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:                                 - Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Val Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Glu Trp Asp Arg Thr His Pro Pro Ala Me - #t Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Ile Asn Trp Ile Thr Ar - #g Pro Asn Asn Pro Val          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Leu Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:46:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:                                 - Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Ser Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Glu Trp Asp Arg Thr His Pro Ile Pro Va - #l Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Thr Thr Ar - #g Pro Asn Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:47:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:                                 - Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Ile Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Val Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Asp Trp Asp Arg Thr His Pro Pro Pro Va - #l Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Glu Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Ile His Trp Ile Thr Ar - #g Gly Gly Asn Ser Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:48:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:                                 - Pro Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Asp Trp Asp Arg Thr His Pro Pro Pro Il - #e Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Glu Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Ile Thr Ar - #g Ala Asn Gln Ser Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:49:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:                                 - Pro Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ser Asp Trp Asp Arg Thr His Pro Pro Pro Il - #e Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Glu Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Thr Thr Ar - #g Pro Asn Gln Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:50:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 282 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:                                 - TGTACACATG GCATCAAACC AACAGTGAGT ACTCACCTAA TATTAAATGG GA - #CACTCTCT          60                                                                           - GAAGGAAAAA TAAGAATTAT GGGAAAAAAT ATCTCGGACA CTGGGAAAAA TA - #TCATAGTG         120                                                                           - ACCCTAAATT CTACTATAAA CATAACCTGT GTGAGACCAT GGAATCAGAC AG - #TACAAACG         180                                                                           - ATAGGAATAG GACCAATGTC CTGGCTCAGC ATGGACATAA ATGCAGATAA AA - #ACAATAAC         240                                                                           # 282              GCGA GTATAACACC ACGGATTGGG AA                               - (2) INFORMATION FOR SEQ ID NO:51:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 279 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:                                 - TGTACACATG GCATCAAGCC CACAGTGAGC ACCCACCTGA TATTAAATGG GA - #CACTCTCT          60                                                                           - GAAGGAAAAA TAAGAATTAT GGGAAAAAAC ATTTCAGATA ATGCGAAAAA TA - #TCATAGTG         120                                                                           - ACCCTAAAAC AGACTATAAG CATAACTTGT GAGAGACCAG GAAATCTTTC AG - #TACAAGAG         180                                                                           - ATAAAAATAG GTCCAATGGC CTGGTACAGC ATGGCCGTAG AGCAAGATAA GT - #CAACCTCC         240                                                                           #   279            AGTA TAATGTCACT AAGTGGAAA                                   - (2) INFORMATION FOR SEQ ID NO:52:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 282 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:                                 - TGTACACATG GCATCAAGCC AACAGTAAGT ACTCAGTTAA TATTAAATGG AA - #CACTCTCG          60                                                                           - GAAGGAAAGA TAAGAATAAT GGCAAAAGAT ATTTTAAATA GTGGCAAAAA TA - #TCATAGTG         120                                                                           - ACCCTAAATA CTACTGTAAA CATGACCTGC GTGAGACCAG GAAATATAAC AA - #TACAAACG         180                                                                           - TTAAAGATAG GTCCACTGGC CTGGTACAGC ATGGACATAG CGAATGAAAA AG - #ACCATAAG         240                                                                           # 282              GTGA GTATAATACC ACTAATTGGG TA                               - (2) INFORMATION FOR SEQ ID NO:53:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 279 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:                                 - TGTACACATG GCATCAAGCC AACAGTAAGT ACTCAGCTAA TATTAAATAG AA - #CACTCTCG          60                                                                           - GAAGGAAAGA TAAAAATAAT GACAAAAAAT ATTTCGGAGA ATGGAAATAT TA - #TAGTGACC         120                                                                           - CTAAATACTA CTATAAACAT GACCTGCGAG AGACCAGGAA ATCTATCAGT AC - #AAGAGATA         180                                                                           - AACATAGGTC CACTGGCCTG GTACAGCATG AGCATAAAGA ATGAAGGAAA AA - #CTGAGTCA         240                                                                           #   279            AGTA TAACAGCACT AATTGGGTA                                   - (2) INFORMATION FOR SEQ ID NO:54:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACCTATGGG GCTGTAAGGG AAGGCTGGTC TGTTACACAT CAGTAAAATG GA - #ACATGTCA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:55:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACCTATGGG GCTGTAAGGG AAGACTAATC TGCTACACAT CAGTAAAATG GA - #ACTCGACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:56:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 119 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TACAGAATCA GC - #AACTCCTA          60                                                                           - AACTCGTGGG GCTGTTGGGA AGACTAGTCT GTTACACATC AGTAGAATGG AA - #CTGGACA          119                                                                           - (2) INFORMATION FOR SEQ ID NO:57:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 120 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57:                                 - AGACAACTCC GAGCTCGCCT GCTAGCCTTA GAAACCTTAA TTCAGAATCA GC - #AACTCCTA          60                                                                           - AACTCGTGGG GCTGTAAGGG AAGACAAGTC TGTTACACAT CAGTAAAATG GA - #ACAATACA         120                                                                           - (2) INFORMATION FOR SEQ ID NO:58:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58:                                 - GCCCTCTCCC CCAGGACTTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA GG - #CCTTTAAC          60                                                                           - CCGGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTGTTCCCTA TG - #ATATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGGACATCAA GGAGCATTAC AAGTGCTAAA AG - #AAGTAATC         180                                                                           - AATGAGGAAG CAGCAGAGTG GGATAGAACT CACCCACAAG CAGTAGGGCC AT - #TGCCACCA         240                                                                           - GGACAGATAA GGGAACCAAC AGGAAGTGAC ATTGCTGGAA CAACCAGTAC CC - #AGCAAGAG         300                                                                           - CAAATTCACT GGACTACCAG GGCCAACCCC CCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:59:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59:                                 - GCCCTCTCCC CCAGGACTTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA GG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTA TCAGAGGGAG CTATTTCCTA TG - #ATATTAAT         120                                                                           - ACCATGCTAA ATGCCATAGG AGGACATCAA GGGGCTCTAC AGGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAAGAAG CAGCAGATTG GGATAGAGCT CACCCACCAG TGGTAGGGCC GT - #TGGCACCA         240                                                                           - GGGCAGATGA GGGACCCAAC AGGAAGTGAC ATCGCTGGGA CAACTAGCAC CC - #AGCAAGAG         300                                                                           - CAAATTCATT GGACTACCAG GCCCAACAAC CCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAA CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:60:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 397 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60:                                 - GCCATTTCCC CTAGGACTTT AAATGCATGG GTAAAGGCAG TAGAAGAAAA AG - #CCTTTAAC          60                                                                           - CCTAAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTGTTCCCTA TG - #ATATTAAT         120                                                                           - ACTATGCTAA ATGCCATAGG AGGACATCAA GGGGCTTTAC AAGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CATCGGAGTG GGATAGAACT CACCCACCAC CGATAGGGCC GT - #TACCACCA         240                                                                           - GGCAGATAAG GGACCCAACA GGAAGTGACA TTGCTGGACA ACTAGCACCC AG - #CAAGAGCA         300                                                                           - AGTTCACTGG ATTACCAGGG CCCCCAACCC TATCCCAGTA GGAGACATCT AT - #AGAAAATG         360                                                                           #     397          AACA AAATGGTAAA AATGTAC                                     - (2) INFORMATION FOR SEQ ID NO:61:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 399 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: DNA (genomic)                                        -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61:                                 - GCCATTTCCC CTAGGACTCT AAATGCATGG GTAAAGGCAG TAGAAGAAAA GG - #CCTTTAAC          60                                                                           - CCTGAAATCA TTCCTATGTT CATGGCATTG TCAGAGGGAG CTATTCCCTA TG - #ATATTAAT         120                                                                           - ACCATGCTAA ATGCCATAGG AGGACATCAA GGGGCTTTAC AAGTGCTAAA GG - #AAGTAATC         180                                                                           - AATGAGGAAG CATCAGAATG GGATAGAACT CACCCACAAC AGGCAGGGCC GT - #TACCACCA         240                                                                           - GGGCAGATAA GGGACCCAAC AGGAAGTGAC ATTGCTGGGA CAACTAGCAC CC - #AGCAAGAG         300                                                                           - CAAGTTCACT GGACTACCAG GGCCGCCAAC CCTATCCCAG TAGGAGACAT CT - #ATAGAAAA         360                                                                           #   399            TAAT CAAAATGGTA AAAATGTAC                                   - (2) INFORMATION FOR SEQ ID NO:62:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 94 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:                                 - Cys Thr His Gly Ile Lys Pro Thr Val Ser Th - #r His Leu Ile Leu Asn          #                15                                                            - Gly Thr Leu Ser Glu Gly Lys Ile Arg Ile Me - #t Gly Lys Asn Ile Ser          #            30                                                                - Asp Thr Gly Lys Asn Ile Ile Val Thr Leu As - #n Ser Thr Ile Asn Ile          #        45                                                                    - Thr Cys Val Arg Pro Trp Asn Gln Thr Val Gl - #n Thr Ile Gly Ile Gly          #    60                                                                        - Pro Met Ser Trp Leu Ser Met Asp Ile Asn Al - #a Asp Lys Asn Asn Asn          #80                                                                            - Ser Arg Ile Ala Tyr Cys Glu Tyr Asn Thr Th - #r Asp Trp Glu                  #                90                                                            - (2) INFORMATION FOR SEQ ID NO:63:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 93 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:                                 - Cys Thr His Gly Ile Lys Pro Thr Val Ser Th - #r His Leu Ile Leu Asn          #                15                                                            - Gly Thr Leu Ser Glu Gly Lys Ile Arg Ile Me - #t Gly Lys Asn Ile Ser          #            30                                                                - Asp Asn Ala Lys Asn Ile Ile Val Thr Leu Ly - #s Gln Thr Ile Ser Ile          #        45                                                                    - Thr Cys Glu Arg Pro Gly Asn Leu Ser Val Gl - #n Glu Ile Lys Ile Gly          #    60                                                                        - Pro Met Ala Trp Tyr Ser Met Ala Val Glu Gl - #n Asp Lys Ser Thr Ser          #80                                                                            - Arg Thr Ala Tyr Cys Lys Tyr Asn Val Thr Ly - #s Trp Lys                      #                90                                                            - (2) INFORMATION FOR SEQ ID NO:64:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 94 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:                                 - Cys Thr His Gly Ile Lys Pro Thr Val Ser Th - #r Gln Leu Ile Leu Asn          #                15                                                            - Gly Thr Leu Ser Glu Gly Lys Ile Arg Ile Me - #t Ala Lys Asp Ile Leu          #            30                                                                - Asn Ser Gly Lys Asn Ile Ile Val Thr Leu As - #n Thr Thr Val Asn Met          #        45                                                                    - Thr Cys Val Arg Pro Gly Asn Ile Thr Ile Gl - #n Thr Leu Lys Ile Gly          #    60                                                                        - Pro Leu Ala Trp Tyr Ser Met Asp Ile Ala As - #n Glu Lys Asp His Lys          #80                                                                            - Ser Arg Thr Ala Tyr Cys Glu Tyr Asn Thr Th - #r Asn Trp Val                  #                90                                                            - (2) INFORMATION FOR SEQ ID NO:65:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 93 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65:                                 - Cys Thr His Gly Ile Lys Pro Thr Val Ser Th - #r Gln Leu Ile Leu Asn          #                15                                                            - Arg Thr Leu Ser Glu Gly Lys Ile Lys Ile Me - #t Thr Lys Asn Ile Ser          #            30                                                                - Glu Asn Gly Asn Ile Ile Val Thr Leu Asn Th - #r Thr Ile Asn Met Thr          #        45                                                                    - Cys Glu Arg Pro Gly Asn Leu Ser Val Gln Gl - #u Ile Asn Ile Gly Pro          #    60                                                                        - Leu Ala Trp Tyr Ser Met Ser Ile Lys Asn Gl - #u Gly Lys Thr Glu Ser          #80                                                                            - Arg Val Ala Tyr Cys Glu Tyr Asn Ser Thr As - #n Trp Val                      #                90                                                            - (2) INFORMATION FOR SEQ ID NO:66:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 42 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Leu Trp Gly Cys Lys Gl - #y Arg Leu Val Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Met Ser Trp Ala                                      #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:67:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 41 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Leu Trp Gly Cys Lys Gl - #y Arg Leu Ile Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Ser Thr Trp                                          #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:68:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 40 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Ser Trp Gly Cys Lys Gl - #y Arg Leu Val Cys Tyr          #            30                                                                - Thr Ser Val Glu Trp Asn Trp Thr                                              #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:69:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 41 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69:                                 - Arg Gln Leu Arg Ala Arg Leu Leu Ala Leu Gl - #u Thr Leu Ile Gln Asn          #                15                                                            - Gln Gln Leu Leu Asn Ser Trp Gly Cys Lys Gl - #y Arg Gln Val Cys Tyr          #            30                                                                - Thr Ser Val Lys Trp Asn Asn Thr Trp                                          #        40                                                                    - (2) INFORMATION FOR SEQ ID NO:70:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70:                                 - Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Val Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Glu Trp Asp Arg Thr His Pro Gln Ala Va - #l Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Glu Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Ile His Trp Thr Thr Ar - #g Ala Asn Pro Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:71:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71:                                 - Ala Leu Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Ser Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ala Asp Trp Asp Arg Ala His Pro Pro Val Va - #l Gly Pro Leu Ala Pro          #80                                                                            - Gly Gln Met Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Ile His Trp Thr Thr Ar - #g Pro Asn Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:72:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72:                                 - Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Val Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ser Glu Trp Asp Arg Thr His Pro Pro Pro Il - #e Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Ile Thr Ar - #g Ala Pro Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Asn Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:73:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 133 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73:                                 - Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Va - #l Lys Ala Val Glu Glu          #                15                                                            - Lys Ala Phe Asn Pro Glu Ile Ile Pro Met Ph - #e Met Ala Leu Ser Glu          #            30                                                                - Gly Ala Ile Pro Tyr Asp Ile Asn Thr Met Le - #u Asn Ala Ile Gly Gly          #        45                                                                    - His Gln Gly Ala Leu Gln Val Leu Lys Glu Va - #l Ile Asn Glu Glu Ala          #    60                                                                        - Ser Glu Trp Asp Arg Thr His Pro Gln Gln Al - #a Gly Pro Leu Pro Pro          #80                                                                            - Gly Gln Ile Arg Asp Pro Thr Gly Ser Asp Il - #e Ala Gly Thr Thr Ser          #                95                                                            - Thr Gln Gln Glu Gln Val His Trp Thr Thr Ar - #g Ala Ala Asn Pro Ile          #           110                                                                - Pro Val Gly Asp Ile Tyr Arg Lys Trp Ile Va - #l Leu Gly Leu Ile Lys          #       125                                                                    - Met Val Lys Met Tyr                                                              130                                                                        - (2) INFORMATION FOR SEQ ID NO:74:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 20 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74:                                 # 20               ATCA                                                        - (2) INFORMATION FOR SEQ ID NO:75:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 22 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75:                                 #                 22TTA AT                                                     - (2) INFORMATION FOR SEQ ID NO:76:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 22 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76:                                 #                 22GGC AT                                                     - (2) INFORMATION FOR SEQ ID NO:77:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 26 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   #= "PRIMER"A) DESCRIPTION: /desc                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77:                                 #              26  CCAT GACAGT                                                 - (2) INFORMATION FOR SEQ ID NO:78:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 21 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78:                                 - Asn Thr Arg Lys Ser Ile Asn Ile Gly Pro Gl - #y Arg Ala Phe Tyr Ala          #                15                                                            - Thr Gly Glu Ile Ile                                                                      20                                                                 - (2) INFORMATION FOR SEQ ID NO:79:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 21 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79:                                 - Asn Thr Arg Lys Gly Ile Asn Ile Gly Pro Gl - #y Arg Ala Phe Tyr Thr          #                15                                                            - Thr Gly Glu Ile Ile                                                                      20                                                                 - (2) INFORMATION FOR SEQ ID NO:80:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 25 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80:                                 - Arg Glu Val Gln Asp Ile Tyr Thr Gly Pro Me - #t Arg Trp Arg Ser Met          #                15                                                            - Thr Leu Lys Arg Ser Asn Asn Thr Ser                                          #            25                                                                - (2) INFORMATION FOR SEQ ID NO:81:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 25 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81:                                 - Arg Asp Ile Gln Glu Met Arg Ile Gly Pro Me - #t Ala Trp Tyr Ser Met          #                15                                                            - Gly Ile Gly Gly Thr Ala Gly Asn Ser                                          #            25                                                                __________________________________________________________________________ 

We claim:
 1. A Group O HIV-1 strain having the morphological and immunological characteristics of a retrovirus selected from the group consisting of I-1543, I-1544, I-1545, I-1546 and I-1547.
 2. The Group O HIV-1 strain comprising at least one sequence selected from the group consisting of: SEQ ID NO:1 to 7, SEQ ID NO:8 to 14 to SEQ ID NO: 15 to 21, SEQ ID NO:50 to 53, SEQ ID NO:54 to 57 and SEQ ID NO:58 to
 61. 3. The strain of claim 2, which comprises SEQ ID NO:50, SEQ ID NO:54 and SEQ ID NO:58.
 4. The strain of claim 2, which comprises SEQ ID NO:51, SEQ ID NO:55 and SEQ ID NO:59.
 5. The strain of claim 2, which comprises SEQ ID NO:52, SEQ ID NO:56 and SEQ ID NO:60.
 6. The strain of claim 2, which comprises SEQ ID NO:53, SEQ ID NO:57 and SEQ ID NO:61.
 7. The strain of claim 2, which comprises SEQ ID NO:7, SEQ ID NO:14 and SEQ ID NO:21.
 8. The strain of claim 2, which comprises SEQ ID NO:4, SEQ ID NO:11 and SEQ ID NO:18.
 9. The strain of claim 2, which comprises SEQ ID NO:1, SEQ ID NO:8 and SEQ ID NO:15.
 10. The strain of claim 2, which comprises SEQ ID NO:2, SEQ ID NO:9 and SEQ ID NO:16.
 11. The strain of claim 2, which comprises SEQ ID NO:3, SEQ ID NO:10 and SEQ ID NO:17.
 12. The strain of claim 2, which comprises SEQ ID NO:5, SEQ ID NO:12 and SEQ ID NO:19.
 13. The strain of claim 2, which comprises SEQ ID NO:6, SEQ ID NO:13 and SEQ ID NO:20.
 14. A peptide which is expressed by a Group O HIV-1 strain of claim
 1. 15. A peptide which is expressed by a Group O HIV-1 strain of claim
 2. 16. A peptide selected from the group consisting of: SEQ ID NO:26 to 35, SEQ ID NO:36 to 42, SEQ ID NO:43 to 49, SEQ ID NO:62 to 65, SEQ ID NO:66 to 69, SEQ ID NO:70 to 72 and SEQ ID NO:73.
 17. An immunogenic composition, comprising a peptide of claim
 14. 18. An immunogenic composition, comprising a peptide of claim
 15. 19. An immunogenic composition, comprising one or more products of translation of the sequences of claim
 10. 20. An antibody which binds to a peptide of claim
 14. 21. An antibody which binds to a peptide of claim
 15. 22. A method of in vitro diagnosis of a Group O HIV-1 strain, comprising: contacting a biological sample collected from a patient, with antibodies of claim 20, and detecting the immunological complexes formed between the HIV-1 antigens which may be present in the biological sample and said antibodies.
 23. A method of in vitro diagnosis of a Group O HIV-1 strain, comprising: contacting a biological sample collected from a patient, with antibodies of claim 21, and detecting the immunological complexes formed between the HIV-1 antigens which may be present in the biological sample and said antibodies.
 24. A diagnostic reagent for a Group O HIV-1, comprising at least one peptide encoded by a nucleotide selected from the group consisting ofSEQ ID NO. 26, SEQ ID NO. 27, SEQ ID NO. 28, SEQ ID NO. 29, SEQ ID NO. 30, SEQ ID NO. 31, SEQ ID NO. 32, SEQ ID NO. 33, SEQ ID NO. 34, SEQ ID NO. 35, SEQ ID NO. 62, SEQ ID NO. 63, SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID NO. 36, SEQ ID NO. 37, SEQ ID NO. 38, SEQ ID NO. 39, SEQ ID NO. 40, SEQ ID NO. 41, SEQ ID NO. 42, SEQ ID NO. 66, SEQ ID NO. 67, SEQ ID NO. 68, SEQ ID NO. 69, SEQ ID NO. 43, SEQ ID NO. 44, SEQ ID NO. 45, SEQ ID NO. 46, SEQ ID NO. 47, SEQ ID NO. 48, SEQ ID NO. 49, SEQ ID NO. 70, SEQ ID NO. 71, SEQ ID NO. 72, and SEQ ID NO.
 73. 